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ABSTRACT 

We present a catalogue of galaxy groups and clusters selected using a fricnds-of-friends 
algorithm with a dynamic linking length from the 2dF-SDSS and QSO (2SLAQ) lu- 
minous red galaxy survey. The linking parameters for the code are chosen through 
an analysis of simulated 2SLAQ haloes. The resulting catalogue includes 313 clusters 
containing 1,152 galaxies. The galaxy groups and clusters have an average velocity dis- 
persion of CT„ = 467.97 kms~^ and an average size of Rdt = 0.78 Mpc h~^. Galaxies 
from regions of one square degree and centred on the galaxy clusters were downloaded 
from the Sloan Digital Sky Survey Data Release 6 (SDSS DR6). Investigating the 
photometric redshifts and cluster red-sequence of these galaxies shows that the galaxy 
clusters detected with the FoF algorithm are reliable out to z~0.6. We estimate masses 
for the clusters using their velocity dispersions. These mass estimates are shown to be 
consistent with 2SLAQ mock halo masses. Further analysis of the simulation haloes 
shows that clipping out low richness groups with large radii improves the purity of 
catalogue from 52% to 88%, while retaining a completeness of 94%. Finally, we test the 
two-point correlation function of our cluster catalogue. We find a best-fitting power 
law model , ^(r) = (r/ro)'^ , with parameters rg = 24±4 Mpc and 7 ^ —2.1 ±0.2, 
which are in agreement with other low redshift cluster samples and consistent with a 
ACDM universe. 
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1 INTRODUCTION 

Galaxy clusters are the largest bound objects in the Uni- 
verse and are important structures for examining the distri- 
bution of matter and how this evolves with time. Investigat- 
ing how the mass-function of galaxy clusters changes with 
time provides an effective method for 
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The first galaxy cluster was unknowingly detected by 
Charles Messier, who recorded the positions of 11 nebu- 
lae in the Virgo cluster in the 18"^ century. Later evidence 
for the existence of clusters of galaxies was provided in the 
work of Harlow Shaple y and Adelaide Ames in the 1930s 
l|Shaplev fc Ame^l 19321) . They severely undermined the idea 
that galaxies are randomly distributed throughout the Uni- 
verse. Probably the most significant figure in the pioneering 
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of galaxy cluster detection was George Abell (|Abel]||l958l ). 
Abell surveyed around three quarters of the sky using pho- 
tographic plates, which meant that he had to identify the 
locations of galaxy overdensities by eye. To avoid including 
field galaxies, Abell defined galaxy clusters as regions 1.5 
Mpc h^^ in radius (the Abell Radius) containing fifty or 
more galaxies that are no more than two magnitudes fainter 
than the third brightest member of the group. Remarkably, 
later studies have confirmed a large number of Abell's clus- 
ters as being genuine bound structures. Due to the simplicity 
of this two-dimensional approach, however, serious problems 
can arise from e.g. inhomogeneities, p h otom etric errors and 
projection effects. iBahcall fc Soneir 3 (| 19831 ) found an ex- 
cess of powe r in the angular correlation function using Abell 
clusters and Ivan Haarlem et al.l (|l997l ) show, using N-body 
simulations, that ^ 1/3 of Abell's clusters have incorrect 
richness estimates and ~ 1/3 of Abell richness class 1 
clusters are missed. Around thirty years after Abell's work 
hybrid photo-digital surveys were able to improve upon the 
photographic plate method for detecting clusters of galaxies 
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l|Maddox et al.lll99(]| '). It was, however, the advent of digi- 
tal CCD surveys that brought about significant advances, a 
good example b eing surveys s uch as the SPSS (York et al] 
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the ongoing GAM A ( Driver et alJbOOsI ). Large skv surveys 



like SDSS signify a major step forward in obtaining galaxy 
data, however analysing that data can be approached and 
interpreted in many different ways. Automated algorithms 
supply a means of reducing subjectivity in the analysis of 
large data sets. Finally, the availability of increasingly de- 
tailed simulations in recent years has enabled more powerful 
tests of the completeness and reliability of cluster catalogues. 

In the last few decades many different algorithms have 
been develop ed to find galaxy clusters. The Counts m 
Cells method (|Couch et al. I I1991I : iLidman fc Peterson|[l996l ') 
looks for enhancements of galaxy surface density in a given 
area. Percolation methods group together galaxies that 
are separated by a distance on the sky less than a given 
threshold distance (lEfstathiou et al.ll 19881 : iDavis et allll985l : 
iDahon et al.ll 19971 : iRamella et al.ll2002l ). T his technique was 
origin ally applied to redshift surveys by iHuchra fc Gelleil 
(|l982h . An extended version of Huchra and Geller's friend- 
of-friends algorithm was later developed to deal with pho- 
tometric redshifts bv lBotzler" et al.l (|2004l '). The chan ges im- 
plemented in the extended friend-of-friends algorithm were 
necessary to account for the large errors associated with 
photometric redshifts (jBotzler et al.|[2"oo4 ). Matched Filter 
techniques model the spatial and luminosity distributions of 
:alaxies in a cluster t o generate a clu ster likelihood map 
Postman et"al] Il996l : fKim et all I2OO2I '). Voronoi Tessella- 
tion decomposes the galaxies in a region of space into dis- 
crete points surrounde d by cells facilitating the identifica- 
tion of overdensitie s (|Kim et al.l |2002| : iLopes et al.l l2004l : 
iRamella et al.|[200ll ). Another method is to look for a Red 
Sequ ence in the colour-magnitude relation of g alaxy clus- 
ters (|Gladders fc Yeell2000l : iKoester et all 120071 ). The Cut 
and Enhance method makes colour and colour-colour cuts 
to produce subsam ples of galaxies in different redshift ranges 
(|Goto et al.ll2002l ). A review of various methods and tech- 
niques used for t he op tical detection of galaxy clusters is 
provided by [Gal (12009 1. 

The Percolation or friend-of-friends (FoF) algorithm is 
the cluster finding method of interest for the purposes of this 
paper. The 2dF-SDSS and QSO (2SLAQ) Luminous Red 
Galaxy Survey, which is subset of photometr ically selected 
lumin ous red galcixies (LRGs) from SDSS (| Cannon et al.l 
I2OO6I ). is a good starting place for the use of the FoF algo- 
rithm because of its relatively small size and the abundance 
of both spectroscopic and photometric data available. 

This paper is the first in a series of two papers exploring 
the detection and analysis of galaxy clusters in the 2SLAQ 
catalogue and will focus on utilising the spectroscopic data 
available in the catalogue. The second paper, Farrens et al. 
(in prep) , will investigate detecting clusters using photomet- 
ric redshifts estimated from the SDSS photometry in the 
2SLAQ catalogue and compare the results with those pro- 
vided in this paper. This comparison will demonstrate the 
reliability of the FoF technique to detect clusters with pho- 
tometric redshifts on a consistent data set. Large data sets 
are required if one is to do cosmology with galaxy clusters. 
The fastest way of obtaining this data is through photomet- 
ric surveys. Upcoming surveys like the Dark Energy Survey 



(DBS), Euclid and Planck will obtain such photometric data 
for large numbers of galaxies across the sky and since it is not 
feasible to obtain spectra for all of these objects within the 
observing time frames, it is therefore necessary to develop re- 
liable methods for detecting clusters using photometric data. 
Photometric cluster catalogues made from surveys such as 
these can cover a larger volume than spectroscopic cata- 
logues and can thus be used to probe the shape of the mass 
function at higher masses. 

The following section provides some background on the 
2SLAQ survey. Section |3] describes the Huchra and Geller 
Friends-of-friends method in detail and how we implemented 
the algorithm. Section |4] provides a description of the mock 
catalogue used to calibrate the linking parameters. Section 
[5] presents the resulting groups and clusters found using the 
FoF algorithm and the analysis made on these results. Fi- 
nally, section [6] shows the 2-point correlation function of the 
clusters. We present a sample of the cluster catalogue in the 
appendix of this paper. 



2 THE 2DF-SDSS AND QSO (2SLAQ) 
LUMINOUS RED GALAXY SURVEY 

2SLAQ is a spectroscopic survey of around 15000 poten- 
tial lumino us r ed galaxies (LRG s) in the redshift range 0.45 
5? z < 0.7 ((Cannon c t all 120061 ). The target LRGs were se- 
lected photometr ically from the Sloan Digital Sky Survey 
(|York et al.ll200(il ). 2SLAQ also includes a lower resolution 
survey of faint quasi-stellar objects (QSOs) or quasars. Ob- 
servations began in March 2003 using the Two-degree Field 
instrument (2dF) on the 3.9m Anglo- Australian Telescope. 

The 2SLAQ criteria for selecting galaxies from the Sloan 
Digital Sky Survey consisted in placing limits on the magni- 
tudes, colours and star-galaxy separations of the SDSS data. 
The magnitude limits were imposed by requiring that: 



17.5 < id^v - A,< 19.8 



ifibre < 21.2 



(1) 

(2) 



where i^ev is the total magnitude based on a fit of each 
galaxy to a de Vaucouleurs profile, Ai is the extinction in the 
i-band and ifibre is the flux contained within the aperture 
of a spectroscopic flbre in the i-band. These limits enabled 
the selection of bright LRGs out to z ~ 0.8 and eliminated 
objects too diffuse to produce useful spectra. The colour cuts 
were such that: 



0.5 < p - r < 3.0 



r - i < 2.0 



C||=0.7(5 - r) + 1.2(r - i - 0.18) > l.i 



di 



(3) 
(4) 
(5) 



(6) 



where cy eliminates later-type galaxies and dx selects early- 
type galaxies at increasingly high redshift. Equations O and 
[4] ensure that only objects near the main locus of LRGs are 
selected. The star-galaxy separation criteria was: 



ipsf — imodel > 0.2 + 0.2(20.0 — idev) 



(7) 
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radiuSd, 



eV(i) 



> 0.2" 



(8) 
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where ipsf is the i-band magnitude of an isolated star fit- 
ted with a point spread function (PSF) model, imodei is the 
model j-band magnitude, which is a good proxy for the PSF, 
and radiustjev is the de Vaucouleurs radius, which is the ef- 
fective radius in the model magnitude. These limits elimi- 
nate the majority of stellar contamination. 



3 FRIENDS-OF-FRIENDS METHOD 

3.1 The Huchra and Geller Friends-of- Friends 
Algorithm 

The Huchra and Geller friends-of-friends algorithm was de- 
veloped to search for groups of galaxies in the magni- 
tude limited CFAl redshift survey iHuchra fc GeUeij|l982l : 
iHuchra et al]|l983l : iGeller fc Huchralll983l ). This technique 
is commutative and utilises only a galaxy's right ascension, 
declination and redshift to detect structure by finding galax- 
ies that are separated by a distance less than some thresh- 
old, Di, and that have a velocity difference less than some 
threshold, Vl. The threshold values are chosen depending on 
the properties of the galaxies in the catalogue. For simplic- 
ity Dl and Vl can be set as fixed values, however this may 
lead to some selection effects being ignored. iHuchra fc GeUeij 
l|l982l ) adopt a method that compensates for the variation 
in the completeness of the galaxy luminosity function as a 
function of redshift. 



3.2 Choice of Dynamic Friends-of-Friends Linlcing 

Th e friends-of-friend s algor ithm used in this paper follows 
the IHuchra fc Gelled (|l982h method in most respects. The 
only difference lies in the linking parameters, which change 
with the surface number density of galaxies at a given red- 
shift. This dynamic linking length is more appropriate to 
this sample of galaxies as is does not assume a magnitude 
limit pre-selection, which would remove large numbers of 
galaxies usable for cluster detection and prevent any clus- 
ters at the highest redshifts from being found. The choice of 
this type of linking parameter has the drawback of allowing 
for a more complex selection function that will render inter- 
pretation less straight forward. A fiuxogram of the Dynamic 
Friends-of-Friends (hereafter DFoF) algorithm is shown in 
figffl 

Following the steps shown in fig[T] First the initial 
values of linking length, Ro, redshift linking, 20, and the 
minimum number of galaxy members needed to form a 
group/cluster, 'Nmin, are chosen. To compensate for selec- 
tion effects, Ro is varied via equation [51 



RfriEnd{z) OC Ro 



dN dz 1 



(9) 



dz dV Asky , 

where z is redshift of each galaxy for which the variable 
linking length is calculated, dN{z)/dz is the surface num- 
ber density of galaxies in the redshift range covered by the 
galaxy catalogue (zmin ^ 2 ^ ^max), dV /dz is the differen- 
tial comoving volume and Asky is the fraction of sky area 
covered in the catalogue relative to the total sky area. Note 
that the limits Zmin and Zmax can be modified if one does 
not wish to use the full redshift range of the catalogue, zq is 



set initial parameters 
Ro, Zoaiid N/?(/« 



find suface number 
density p of galaxies iti 
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Rfrierid= Rwr(p) 
Vfrieml = z<iCl2(l+z) 



_L 



find galaxy I not in a 
group with: 
7.min < Zi < 7.ma 
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Dy < \Kfriend 
& 

Iv/ - y/1/2 < Wfi-iend 
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isolated 



yes 
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galaxy list. 

Add galaxies /j'to 
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search for friends of 
friends 



yes 



add friends of friends 
to group-candidate k 
list 
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yes 
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Figure 1. Fiuxo gram of the FoF a l gorith m with dynamic linking 
length. Based on lHuchra fc Gelled |l982h . 



converted into a velocity linking parameter, Vfriend{z), via 
equation 1101 



f riend (2) 



2()C 



2(1+2) 



(10) 



where c is the speed of light. The factor of 2 in the denomi- 
nator accounts for the fact that we want to be able to link 
galaxies that have velocities approaching along the line of 
sight to those that have velocities receding along the line 
of sight. vfriend{z) will therefore be representative of the 
velocity dispersions of the clusters being examined. 

A galaxy i is selected from the catalogue that has not 
yet been assigned to a group and lies in the redshift range 
Zmin ^ z ^ Zmax ■ The projccted distance between galaxy i 
and second galaxy j, Dij, is calculated using: 

Dij = cos^^ {sin{Si)sin{Sj) + cos{Si)cos[Sj)cos[ai — ctj)) 

(11) 

where a and 5 are right ascension and declination respec- 
tively. 

The two galaxies are linked together (i.e. are friends) if 
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they satisfy the conditions in equations 1121 and 1131 

Dij ^ Rfrimd(z) (12) 



end (■2') 



(13) 



A group-candidate k is formed that includes galaxy i 
and its friends. A search is then made around the galaxies 
linked to i. This process is repeated until no further friends 
are found. The group-candidate k is defined as a real group 
if it satisfies equation 1141 



> Nrr 



(14) 



For the purposes of this paper Nmin ^ 3. This is a reasonable 
assumption when dealing with LRGs, which are not common 
objects. 



4 MOCK CATALOGUE 

To determine the optimum values of Ro and zq for the DFoF 
code, a mock galaxy catalogue that simulates the 2SLAQ 
catalogue was produced. This mock catalogue contains a 
distribution of 7,651,076 dark matter haloes and 824,704 
galaxies across an octant of the sky. 

The h alo catalogue was derived from the Ho rizon 47r 
simulation (|Tevssier et al.ll2009l : [Prunet et al.ll2008f ). This is 
a ACDM dark matter iV-body simulation using WMAP 3 
cosmology with a 2h~^ Gpc periodic box on a grid of 40,963 
cells. The 7 x 10^" particles were evolve d using the Pa rti- 
cle Mesh scheme of the RAMSES code (|Tevssied [ioo^ l on 
an adaptively refined grid (AMR) with around 1.4 x 10^^ 
cells, reaching a formal resolution of 262,144 cells in each 
direction (~ 7h~^ kpc comoving). The simulation covers a 
sufficiently large volume to compute a full-sky dark mat- 
ter distribution, while resolving Milky- Way size haloes with 
more than 100 particles and exploring small scales deeply 
into the non-linear regime. The dark matter distribution in 
the simulation was integrated in a light cone out to redshift 
1, around an observer located at the centre of the simulation 
box. The underlying cosmology for WMAP 3 is: Q,m = 0.24, 

= 0.76, S^b = 0.042, n = 0.958, Ho = 73 and cts = 0.77. 

Gravitationally bound haloes of dark matter are se- 
lected using the spheri cal overdensity HOF0 method of 
lEisenstein fc HutI (liggsl ). HOP is based on a hybrid ap- 
proach in which the local density field is first obtained by 
smoothing the density field with an SPH-like kernel using 
the n nearest neighbours. Then the particles above a given 
threshold are linked with their highest density neighbours 
until, after several "hops", they are connected to the one 
particle with the highest density within the region above the 
threshold. All particles linked to the local density maximum 
are identified as a group. 

The haloes were then populated using the Halo Oc- 
cupation Distribution (HOD), where the number of galax- 
ies residing within each halo is drawn from a probability, 
P{N\M), that a dark matter halo of mass M will host 
galaxies. 

The first moment of P(N\M) is the mean number of 



^ http:/ /cmb. as. arizona.edu/~eisenste/hop/hop. html 



^ 5.0x10"^- 




Figure 2. Number density as a function of redshift for the mock 
halo catalogue (blue dashed line) and the 2SLAQ catalogue (red 
solid line). 



Table 1. T 

he HOD parameter s derived from the MegaZ-LRG sample using 
the methodology of lBlake et al ] l|2008l '). The small differences be- 
tween these parameters and those of lBlake et al. reflect 
the difference in the assumed underlying cosmology. 



Redshift slice 






/3 






^°^(^) 


0.45 <z< 0.50 


0.618 


13.88 


1.41 


12.96 


0.50 <z < 0.55 


0.469 


13.99 


1.54 


13.00 


0.55 <z< 0.60 


0.554 


14.16 


1.66 


13.23 


0.60 < 2 < 0.65 


0.675 


14.43 


1.56 


13.60 



galaxies as a function of halo mass and it is usually param- 
eterised as a sum of a central and a satellite components 
(jKravtsov et al.ll2004l : lTinkeill2007l l. 

The probability that a halo contains a central galaxy is 
given by: 

flogj^oiM/Mcnt)" 



(AfclAf) =0.5 



1 -f erf 



Ccut 



(15) 



and the number of satellite galaxies is obtained from a Pois- 
son sampling of: 



(Ns\M) = 



(16) 



Mo) 

The HOD model used is that of lBlake et all (|2008l ) and 
includes several derived parameters that were computed at 
various redshifts bins between 0.4 < z < 0.7. Table [T] lists 
the HOD parameters in each redshift bin. For the purposes 
of this paper an evolving HOD model is constructed that 
smoothly interpolates between the four iBlake et all (|2008l ) 
redshift bins as shown in figH) This fig ure shows that ou r 
smoothly evolving HOD model, based on lBlake et al.l (|2008l ). 
succeeds in reproducing the observed number density of the 
2SLAQ data. 

The radial positions of satellite galaxies within a halo 
are a ssigned according to the NFW profile (|Navarro et al.l 
Il996l ). Specifically, it is assumed that the mass inside a given 
radius, properly normalised, represents the probability of 
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containing a galaxy. Thus, integrating to the halo boundary 
would yield a probability of one and therefore all galaxies 
would be placed within this radius. The angular position rel- 
ative to the halo centre is chosen randomly for each galaxy. 

In our model, all dark matter particles are assumed to 
be in approximately spherical virialised haloes. The velocity 
of a dark matter particle is the sum of two terms. 



«vir + Whalo , 



(17) 



the first is due to the velocity of the particle about the centre 
of mass of its parent halo, and the second is due to the 
motion of the centre of mass of the parent. 

Consider the first term, Vvir- We will assume that viri- 
alised haloes are isothermal spheres, so that the distribution 
of velocities within them is Maxwellian. This is in reason- 
able agreement with measurements of virial velocities within 
haloes in numerical simulations. If a^ir denotes the rms 
speeds of particles within a halo, then the virial theorem 
requires that: 



Gm 2 H{zY ^1/3 

— OC cr„,, OC A/,^ 



3m 



47rpcrit(z) 



2/3 



(18) 



where the final proportionality comes from the fact that all 
haloes have the same density whatever their mass: m/r^ oc 
/^vir pcrit- This sliows that Gvir OC m}^^: the more massive 
haloes are expected to be 'hotter'. At fixed mass, the con- 
stant of proportionality depends on time and cosmology, and 
on the exact shape of the density p rofile of the halo. A conve- 
nient fitting formula is provided bv lBrvan fc NormanI (|l998h : 



avi.(m,z) = 102.5 g.Ai{,«(z) 



VlOi3M0//i 



1/3 



(19) 



where — 0.9, and 

Avir = IStt^ -I- 822: - 392;^, with x = D,{z) - 1 (20) 

and il{z) = [Qm (1 + ^)^] [Ho/H{z)]'^. It ha s been shown that 
a(M ) is independent of local environment (|Sheth fc Diaferiol 
[2003), however it may depend on position within a halo. This 
will mainly be due to the fact that haloes have complicated 
density and velocity profiles. 

The two-dimensional redshift-space correlation function 
for the mock catalogue, ^{a, tt), was calculated with the pair 
separation decomposed in terms of perpendicular, a, and 
parallel, tt, distances. Each component of separation was 
calculated for 20 bins equally spaced in comoving distance 
between 0.01 < ct, 7r[Mpc/i~^] < 40, creating a 400 element 
grid of measurements. The smooth contours are created by 
linear interpolation over the grid. Fig[3] shows the calcu- 
lated 2-D correla tion functions for the 2SLAQ catalogue of 
IRoss et"al] |2003) and the mock galaxy catalogue. It shows 
the expected linear squashing at large radii and redshift dis- 
tortions with 'finger of God' elongations at small scales. The 
colour map represents log(^). 

Fig[2] and |3] show that the simulation reproduces the 
properties of 2SLAQ well enough for us to believe that it is 
sufficient to aid us in calibrating our cluster finding param- 
eters and therefore any limits imposed upon the values of 
Ro and zo from the simulation will be applicable to the real 
data. The discrepancy between the real and mock 2SLAQ 
data in fig|3]at low values of vr is due to shot noise {i.e. there 
is a small number of pairs at this scale). There are, however, 



0.5 




1.5 



Figure 3. The two-dimensional rcdshift-space correlation func- 
tion for the 2SLAQ catalogue (left half) and the mock catalogue 
(right half) plotted as a function of transverse, cr, and radial, tt, 
pair separation. The colour map represents log ($). The 2SLAQ 
correlation function is that of lRoss et al. 



some caveats that should be taken into account that may 
cause the simulation to not be fully representative of reality. 
For example, the effects of things such as the way in which 
galaxies are added to dark matter haloes and cluster merg- 
ers at high redshifts could have influence on the results. In 
the following section, we discuss how the simulations can 
bias the results together with the analysis of the results. 



5 DATA ANALYSIS 

5.1 Linking Parameter Optimisation 

The DFoF code was run on the 2SLAQ simulation with dif- 
ferent values of Ro and zq to find the optimum combination 
of the two. To gain a more physical interpretation of these 
parameters, hereafter Ro and zo are expressed in terms of 
Rfriend{z) and Vfriend{z) Sit z — 0.5 uslug equatious [9] and 
1101 The parameters were varied in the ranges: 

< Rfr.end{z = 0.5) < 4.2 (Mpc /l"') (21) 

100 ^ Vfrieud{z = 0.5) < 1500 (kms^^) (22) 

The choice of parameters is made based on the complete- 
ness, purity and total number of clusters in the resulting 
catalogues. 

In order to determine the completeness and purity of 
the DFoF clusters relative to the simulation haloes, a mem- 
bership matching code was implemented. The technique in- 
volves looking at the galaxy members assigned to each clus- 
ter by the DFoF code and matching these to the original halo 
member galaxies. The clusters are examined in descending 
order of richness to ensure that the largest clusters are the 
first to be matched to mock haloes. DFoF clusters may con- 
tain contributions from several mock haloes, therefore each 
cluster is matched to the mock halo with the highest num- 
ber of shared members. Thus, if two haloes are merged into 
one cluster only one of the two haloes will be matched. 
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Figure 4. Number of clusters detected in the mock halo catalogue as a function of R/riendC^ = 0.5) and Vfriendi^ = 0.5) (top panel). 
Unique completeness (middle left panel) and purity (middle right panel) of the DFoF clusters relative to mock haloes as a function of 
Rfriendi^ = 0.5) and i'/rienii(2 = 0.5) . Non-unique completeness (bottom left panel) and purity (bottom right panel) of the DFoF 
clusters relative to mock haloes as a function of Rfriendi^ = 0.5) and vj^i^ndi^ = 0.5). 



For the purposes of this paper we examine two match- 
ing scenarios: a) a strict unique- matching regime, b) a less 
strict multiple-matching regime. In the first case each clus- 
ter is uniquely matched to one halo and any cluster that 
corresponds to a halo that has already been matched, which 
will be of equal or lesser richness, will be ignored. In the lat- 
ter case multiple clusters are allowed to match to the same 
halo, which increases the completeness. The completeness 
and purity are defined via equations 1231 and 1241 



Completeness = 



Nhaloe 



(23) 



Purity : 



Nrr 



(24) 



Where Nmatchea is the total number of unique or multi- 
ple matches, Nduatcrs is the total number of DFoF clus- 
ters found and Nhaioes is the total number of mock haloes. 
Nhaioes is calculated ignoring all mock haloes with less than 
3 galaxy members as this is the detection limit of the DFoF 
code. 

FigU shows the total number of clusters found in the 
2SLAQ mock (top panel), unique completeness (middle left 
panel), unique purity (middle right panel), non-unique com- 
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Figure 5. Completeness (left panel) and purity (right panel) as a function of Rfriendi^ = 0.5) for a fixed value of f/rien[i(z = 0.5) = 900 
kms~^. The red solid line shows clusters with Nmem ^ 3, the blue dashed line shows clusters with Nmem ^ 4, the light blue dot-dashed 
line shows clusters Nmem ^ 5 and the green triple dot-dashed line shows clusters with Nmem ^ 6. 
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Figure 6. Number of clusters detected in the 2SLAQ catalogue as a function of Rfriendi^ = 0.5) and i'/riend(2; = 0.5). 



pleteness (bottom left panel) and non-unique purity (bot- 
tom right panel) as a function of Rfriend{z = 0.5) and 
Vfriend{z — 0.5). TMs figure clearly shows that the DFoF 
results are primarily dependent on the choice of Rfriend{z = 
0.5). Looking at the overall contour shape in the top panel, 
one can see that the number of clusters found peaks around 

Rfriendiz = 0.5) = 2.8 Mpc and Vfriendiz = 0.5) = 

900 kms~^ after which clusters are merged together creat- 
ing highly unphysical structures. This sets an upper limit 
on both of the parameters, although we would intuitively 
expect the value of Rfriend{z = 0.5) to be much lower. For 
Vfriend{z = 0.5) > 400 kms~^, the catalogue is fully com- 
plete in the range 0.35 < Rfriend{z — 0.5) < 0.87 Mpc h^^ 
and fully pure out to Rfriend{z = 0.5) = 0.28 Mpc for 
both the unique and non-unique regimes. 

Since Rfriend{z ~ 0.5) has a greater effect on the com- 
pleteness and purity, we can fix Vfriend{z = 0.5) to the limit 
imposed by the top panel of figUJand examine how the com- 
pleteness and purity vary with just Rfriend{z = 0.5). Fig[5] 
shows the variations in completeness and purity as a func- 
tion of Rfriend{z = 0.5) for different richness cuts using a 
fixed value of Vfriend{z ~ 0.5) = 900 kms~^. The richness 
cuts show what the completeness and purity would look like 



if we remove groups that have fewer members than some 
given threshold. It should be noted that the value of Nhaioes 
is unchanged and therefore haloes which have fewer mem- 
bers than the richness cut threshold are no longer matched. 
This figure indicates that the richer clusters are more pure, 
but less complete, as one would expect. Because it is not 
possible to produce a catalogue that is 100% complete and 
100% pure, it is necessary to choose one or the other, or some 
compromise between the two requirements. If we choose a 
catalogue that is ~ 100% complete, fig[5] shows it can be cut 
by richness to improve the purity. 

Finally, the DFoF code was run on the real 2SLAQ 
catalogue to investigate the number of clusters found for a 
given set of linking parameters. Fig[6] shows the total num- 
ber of clusters found in the 2SLAQ catalogue as a function of 
Rfriend{z = 0.5) and Vfriend{z — 0.5). Both this figure and 
the top panel of fig[4]have the largest A'cit contours roughly 
in the range 2.5 ^5 Rfriend{z ~ 0.5) 4.0 Mpc h^^. 

Therefore in order to obtain a fully complete catalogue 
with high purity and the largest number of clusters possible, 
based on the results in figU) [S] and [S] values of Rfriend{z = 
0.5) = 0.87 Mpc and Vfriend{z = 0.5) = 900 kms"^ were 
chosen to find groups and clusters in the 2SLAQ catalogue. 
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These values correspond to a 2SLAQ mock cluster catalogue 
that is 98% complete and 52% pure. 

5.2 Basic Results 

Running the DFoF algorithm using a linking length of 
Rfriend{z = 0.5) = 0.87 Mpc and a velocity linking 
parameter of Vfriend{z = 0.5) = 900 kms~^ on the 13,133 
galaxies in the 2SLAQ sample produced a total of 313 groups 
and clusters containing 1,152 member galaxies. Fig[7] shows 
the angular position of the centres of these groups and clus- 
ters and their distribution with respect to redshift. The clus- 
ter centres and redshifts are taken as the average values of 
each of the galaxy members. 

FigH] shows the distribution of cluster velocity disper- 
sion, (7„, as a function of redshift. Fig|9] shows the distribu- 
tion of cluster size, Kelt, as a function of redshift. Where Hdt 
is defined as the projected distance from the cluster centre 
to the farthest galaxy member. The 'x's indicate clusters 
with three members, the blue circles indicate clusters with 
between four and six members and the red squares indicate 
clusters with seven or more members. These plots illustrate 
that on average the richest clusters have the largest sizes 
and velocity dispersions as expected. 

5.3 Background Subtraction 

The DFoF groups and clusters were examined by look- 
ing for peaks in the photometric redshift distribution and 
colour-magnitude relation of SDSS galaxies around the clus- 
ter centres. Using an SQL query, regions of one square de- 
gree and centred on the FoF cluster candidates were down- 
loaded from the SDSS DR6 catalogue. These squares were 
downloaded for each of the 313 DFoF groups and clusters 
and include all SDSS galaxies with photometric redshifts. 
The properties downloaded for each galaxy were: id, ra, 
dec. It-magnitude, gr-magnitude, r-magnitude, i-magnitude, 
^-magnitude, model i-magnitude, photometric redshift, pho- 
tometric redshift error and star likelihood. In each of these 
squares all galaxies within a 1 Mpc h^^ radius of the centre 
were taken as cluster members with background, regardless 
of redshift, and all galaxies between 3 and 7 Mpc were 
used as field galaxies. Galaxies between 1 and 3 Mpc 
were ignored. These limits were chosen because, on aver- 
age, we would not expect galaxies father than 3 Mpc 
from a real cluster centre to be genuine members. Using 
the DFoF cluster redshifts, the data from the corresponding 
SDSS squares were stacked in redshift slices of Az = 0.05 
in the range 0.4 ^ z 55 0.7. The total number of groups and 
clusters with three or more and four or more members in 
each redshift slice are listed in Table-[51 

In each cluster redshift bin the stacked SDSS galax- 
ies were binned by photometric redshift. The number of 
SDSS galaxies assigned to the field was then subtracted from 
the number of SDSS galaxies assigned as cluster members 
plus background in each redshift bin. This subtraction was 
done taking into account the difference in area between the 
two regions. Fig llOl shows the number of SDSS background 
subtracted galaxies as a function of photometric redshift 
for all groups and clusters. The blue dashed line are the 
background galaxies, the green dot-dashed line are the fore- 
ground galaxies and the red solid line are the background 



Table 2. Redshift Slices 



z Range 


'Nclt with Nme„i ^ 3 


Ncli with Nmem 


0.40 s£ z s£ 0.45 
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0.55 s£ 2 ^ 0.60 


62 
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subtracted galaxies. In these plots the background subtrac- 
tion should average out to zero if there is no overdensity 
in the field. A sharp peak can clearly been seen in each of 
the plots at the cluster redshift range. This trend is reliable 
up to 2 ~ 0.6 after which the peaks cannot clearly be dis- 
tinguished. This a strong indication that the DFoF cluster 
candidates are genuine structures. The discrepancies seen af- 
ter z ~ 0.6 are due to the relatively low number of clusters 
at these redshifts plus a result of poor photometric redshift 
estimates for more distant objects in the SDSS catalogue. 

To look for the colour-magnitude relation of the groups 
and clusters, i-magnitude vs {g — i)-colour maps were made 
for both field galaxies and cluster-|-field galaxies in each 
SDSS square. The C-M maps were then stacked by clus- 
ter redshift in redshift slices as seen in Table-[2l Finally, the 
stacked field galaxy maps were subtracted from the stacked 
cluster galaxy maps taking into account the relative ar- 
eas. Fig ll II shows the background subtracted C-M diagrams 
for SDSS galaxies. The maps show a clear trend in colour- 
magnitude space that resembles a cluster red sequence. The 
cluster red sequence is an observational property whereby 
cluster galaxies are more red than field galaxies at the same 
redshift. As with the plots in fig l 101 this trend is visible out 
to 2 ~ 0.6 and similarly indicates that the structures are 
genuine. 

5.4 Mass estimates 

In order to calculate masses for our sample of groups and 
clusters, we compare mass estimates calculated using equa- 
tion[T5]with the true masses of the 2SLAQ simulation haloes. 
Masses were calculated for the DFoF catalogue produced 
from the 2SLAQ mock with linking parameters Rfriend{z = 
0.5) = 0.87 Mpc h-^ and v;„end{z = 0.5) = 900 kms"^ 
using the cluster velocity dispersions (hereafter Meat)- The 
membership matching code described in section 5.1 was then 
implemented to assign to each cluster a mock halo mass 
(hereafter Mtrue)- 

Fig |12l shows the average true cluster mass, Mtrue, as 
a function of cluster richness, N,-nem, and the average esti- 
mated mass, Mest, as a function of true mass. The error bars 
show the standard error of the mean. The left panel of this 
plot shows the expected trend between cluster richness and 
mass. The right panel compares the matched halo masses to 
those calculated from the cluster velocity dispersions accord- 
ing to eauation ll9l The deviations from the x = y trend are 
principally the result of contaminating galaxies in the rich- 
ness estimates and the small number of high mass clusters 
detected. At the low mass range clusters are composed of 
only 3 or 4 LRGs, hence their velocity dispersions are unre- 
liable and one object that is either missing of interloping in 
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Figure 7. Distribution of clusters in RA and Dec (left panel). Histogram of clusters as function of redshift (right panel). 
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Figure 8. Distribution of cluster velocity dispersion, cr„, as a function of redshift (left panel). The 'x's indicate clusters with less than 
four members, the blue circles indicate clusters with between four and seven members and the red squares indicate clusters with more 
than seven members. The gold asterisks highlight the groups and clusters that form part of the gold sample (see §5.5). Histogram of 
galaxy members as a function of cluster velocity dispersion for the 3 choices of richness as in the left hand plot (right panel). 
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Figure 9. Distribution of cluster size, Rdt, as a function of redshift (left panel). The 'x's indicate clusters with less than four members, 
the blue circles indicate clusters with between four and seven members and the red squares indicate clusters with more than seven 
members. The gold asterisks highlight the groups and clusters that form part of the gold sample (see §5.5). Histogram of galaxy members 
as a function of cluster size for the 3 choices of richness as in the left hand plot (right panel) . 
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Figure 10. Number of SDSS background subtracted galaxies as a function of piiotomctric redsliift for all groups and clusters. The blue 
dashed line are the background galaxies, the green dot-dashed line are the foreground galaxies and the red solid lino are the background 
subtracted galaxies. 



the cluster will have a large impact on the cluster velocity 
dispersion and hence the mass estimate. At the high mass 
range cluster velocity dispersions should be more reliable, 
but there are very few objects that contribute to the av- 
erage mass estimate. In addition to this, in the high mass 
range, when all the galaxies in an individual mock halo have 
been detected perfectly by the DFoF code, the cluster mass 
from the velocity dispersion tends to be overestimated by 
around 1O°'*M0. This is, however, comparable with the size 
of the error bars in the high mass range of fig |12l 

The range of cluster masses is a good match to known 
masses of massive clusters. Therefore, from this analysis we 



can assume that mass estimates made with the real 2SLAQ 
cluster velocity dispersions will be approximately represen- 
tative of their true physical masses with a large amount of 
uncertainty, particularly in the low mass range. This uncer- 
tainty could be reduced with a larger sample of clusters and 
by including all cluster galaxies, rather than just the LRGs, 
to calculate the velocity dispersions. 

5.5 Clipping 

In section 5.1 we chose to produce a catalogue that opti- 
mised the completeness, while maintaining the highest pos- 
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Figure 11. Colour-Magnitude diagrams for background subtracted SDSS galaxies. The maps show a clear trend in colour-magnitude 
space that resembles a cluster red sequence. 



sible purity. This choice resulted in a catalogue that was 
98% complete and 52 % pure. Fig. [S] shows that clipping 
groups and clusters with the lowest richnesses will improve 
the purity, but will reduce the completeness. This plot also 
shows that the purity is much higher for lower values of 
Rfriend{z = 0.5), which would correspoud to structures that 
have smaller radial sizes. Therefore, it appears that the pu- 
rity is mainly affected by the size and richness of the groups 
and clusters. 

In an attempt to improve the purity while preserving 
the completeness, several new catalogues were made by clip- 
ping out groups and clusters with sizes larger than some 
threshold for a given richness from the original 2SLAQ mock 



cluster catalogue. Table |3] shows the total number of groups 
and clusters with 3 and 4 members for various radial cuts. 
Rciip is the size threshold in Mpc above which all groups 
and clusters will be removed, Ntot is the total number of 
groups and clusters for a given KcUp, Ntme is the number of 
groups and clusters that are matched to 2SLAQ mock haloes 
for a given KcUp and the ratio Ntrue/Ntot gives a measure 
for the purity for a given KcUp. As can been seen in the 
table, when no cuts are made groups of 3 and 4 members 
are only 38% and 57% pure respectively. Above richness of 
4 all groups and clusters are above 80% pure. Therefore, the 
majority of the contamination in the cluster catalogue arises 
from these small groups. 
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Figure 12. Average true cluster mass, Mtrue, as a function of cluster richness, Nmem (left panel). Average estimated mass, Meat, as a 
function of true mass (right panel). Error bars are the standard error of the mean. 



Taking only groups and clusters with sizes less than 0.11 
Mpc for Nmem = 3 and less than 0.49 Mpc for N„iem 
= 4 produces a new catalogue which is 94% complete and 
88% pure. These values were chosen because they provide 
the best improvement to the purity with minimal change to 
the completeness. Thus, by clipping out clusters with a low 
richness (i.e. few members), but relatively large sizes, we 
can improve the purity by around 36%, while reducing the 
completeness by only 4%. 

Applying this same clipping procedure to the 'real' 
2SLAQ groups and clusters, we can separate the catalogue 
into 'gold' and 'silver' samples. Where the gold clusters are 
those that pass the chpping procedure and therefore are the 
most likely to be genuine. The silver clusters fail the clipping 
procedure and may still be genuine, but the probability is 
lower. Out of the 313 total 2SLAQ groups and clusters, 70 
are gold and the remaining 243 are silver. The gold sam- 
ple groups and clusters have an average velocity dispersion 
of ffv = 585.28 kms~^ and an average size of Rdt ~ 0.90 
Mpc , while the silver sample groups and clusters have 
an average velocity dispersion of ~ 430.47 kms~^ and 
an average size of R^u = 0.53 Mpc . The average prop- 
erties of the gold sample are larger than those of the full 
(gold-|-silver) sample as a large fraction of the small struc- 
tures have been removed. 

It should be noted that, although we are confident that 
our mock is a good representation of the 2SLAQ galaxies, 
it is difficult to interpret how well this clipping procedure 
will translate to the real 2SLAQ catalogue. Therefore we do 
not remove the silver sample clusters, rather we assign to 
them a lower likelihood of being genuine than the gold sam- 
ple clusters. The final 2SLAQ cluster catalogue contains an 
additional column that identifies each cluster as belonging 
to the gold or silver samples with 'G' or 'S' respectively. In 
fig[8] and [9] the groups and clusters that form part of the 
gold sample are highlighted with gold asterisks. 



5.6 Cluster Images 

Fig. [T3] shows SDSS DR7 optical g, r and i-band colour 
images of the regions around the DFoF detected clusters 
CL_008, CL_204, CL_122, CL_024 and CL_038. The yellow 



Table 3. Radial Clipping: RcHp is the size threshold in Mpc h^^ 
above which all groups and clusters will be removed, ^tot is the 
total number of groups and clusters for a given RcHpi ^true is 
the number of groups and clusters that are matched to 2SLAQ 
mock haloes for a given ^cUp and the ratio Ntrue/Ntot gives a 
measure for the purity for a given RcHp- 
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circles highlight the positions of the cluster member galax- 
ies, which are labeled with their individual redshifts. These 
clusters were chosen to sample serval different redshift bins 
as listed in Table [2l 

These images show that the galaxies are distributed in 
a small area on the sky and are close in redshift space. An 
important point to notice is that the distribution of cluster 
LRGs in each image varies from spherical to elongated fila- 
mentary structures. This is an advantage of the FoF method, 
which makes no prior assumptions about the shape of the 
groups and clusters. CL_204 (top right panel) in particular 
shows a rather filamentary distribution of cluster LRGs. The 
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full cluster will be comprised of many galaxy most of which 
will not be LRGs and therefore the true shape could well 
be more spherical. Other techniques that have prerequisites 
on the shape of clusters would not be able to detect this 
particular object using only the LRGs. 



6 CORRELATION FUNCTION 

The spatial distribution of galaxies and clusters of galaxies 
contains a wealth of information regarding the underlying 
cosmological model. The most widely used method in the 
literature for condensing this information is to measure the 
autocorrelation function of the positional data. For our pur- 
poses we measure the correlation function simply to compare 
the statistical spatial distribution of our cluster sample to 
other derived cluster samples. 

We calculate the two-point correlation function for the 
313 2SLAQ groups and clusters obtained previously. Using 
the 2SLAQ spectroscopic galaxy catalogue we create a ran- 
dom catalogue, which replicates the angular completeness 
on the sky, this process is handled straightforwardly in the 
Healpi4EI package of software. The radial distribution of the 
random catalogue is obtained from a smooth spline fit to 
the galaxy redshift distribution, n{z). The smoothing en- 
sures the exclusion of large scale structure e.g voids and 
filaments. 

The correlation f unction is calculated w ith the Landy 
and Szalay estimator (|Landv fc Szalavll 19931 ): 



DD - 2DR + RR 
RR 



(25) 



where DD, DR and RR are the number of pairs of points 
in the data, D, and random, R, catalogues. The number 
of pairs are calculated in 14 bins, equally separated in log 
space, from r = 5 — 90 Mpc . 

The correlation function is usually represented as a 
power law, ^(r) = {r/ro)~', thus making a comparison with 
other works quite straight forward. In the left panel of fig[T4] 
we plot the two-point correlation function of the 2SLAQ 
clusters (blue squares with error bars) and a best-fit power- 
law slope (red solid line). The error bars are estimated us- 
ing the jackknife method, which involves dividing the survey 
into A'' sections with equal area or volume. The variance and 
mean are estimated from measures of our statistic. Each 
measurement is performed on the survey with region i re- 
moved, where i — 1, N. 

The jackknife estimate of the variance is (|Luptonll 19931 ): 



(26) 



where Njk is the number of Jackknife samples used and Vi 
represents a single bin in our statistic, ^. In this analysis we 
set Njk. = 20 sample. 

The power law model is best fit with the parameters 
ro = 24 ± 4 Mpc and 7 = -2.1 ± 0.2, and the value of 
the reduced chi-squared is Xred = 0.94. 

The right panel in fig |14l shows the two-point correla- 
tion function of the mock haloes (black squares with error 



This software is available from |http : / /healpix. j pi . nasa. gov7 



bars) and a best-fit power-law slope (red solid line). The 
correlation of the mock haloes is best fit with parameters 
ro = 23.05 ± 0.72 Mpc h'^ and 7 = -1.95 ± 0.05, which 
are perfectly consistent with the parameters found for the 
2SLAQ groups and clusters. The error bars are measured 
using the jackknife method as before. 

Although the errors are much larger for the real 2SLAQ 
data, this analysis indicates that the catalogue of groups and 
clusters found within 2SLAQ using the DFoF code shows the 
correct level of clustering with respect to the mock halo cat- 
alogue. In addition, significant amounts of impurity and/or 
incompleteness in the cluster catalogue would suppress the 
correlation length as impurity adds randomly placed struc- 
ture and incompleteness removes correlated structure. Thus, 
as both the real and mock 2SLAQ data have consistent cor- 
relation lengths, this implies that the amount of incomplete- 
ness and impurity in the 2SLAQ cluster catalogue is not 
significant. 

The corr elation length of t he 2SLAQ LRG sample was 
cal culated bvlRoss et all (l2007l) . 7.45 ± 0.35 Mpc and 
bv lSawangwit et all (|2009l ). 7.5 ± 0.04 Mpc , however 
this value is expected to be larger for clusters of galaxies. 
Comparing the ro value obtained with those found using 
other low redshift cluster sa mples shows good ag reement, 
e.g ro = 26.0 ± 4.5 Mpc h'^ jBorgani et al]|l999l ). 19.4 ^ 



ro ^ 23.3 Mpc h~\ jMiller et al.ll 19991) . 18.8 ± 0.9 Mpc h'^ 



(ICoUins et al.ll2000l ). See lNicholl ([200TI ) for a good review of 
ro values for various cluster samples. The clustering length 
found here is also consi stent with that expe cted for ACDM, 
22 ^ ro ^ 27 Mpc jColberg et al.ll2000l ). 



7 CONCLUSIONS 

We have written an optical cluster finding algorithm based 
on that of lHuchra fc Gelleil (| 19821 ). Our dynamic friends-of- 
friends (DFoF) code uses a linking length that compensates 
for selection effects by changing size depending on the sur- 
face number density of galaxies at a given redshift. 

We produced a mock catalogue, which is representative 
of the 2SLAQ catalogue, usi ng the Hori z on 47 r simulation 
and the HOD prescription of iBlake et al.l (|2008l ) in order to 
determine the linking parameters, Rfriend{z) and Vfriend{z), 
in the DFoF code. The code was run with various combina- 
tions of the two parameters producing a set of distinct cat- 
alogues. We wrote a membership matching code in order to 
determine the completeness and purity of the resulting cat- 
alogues relative to the original mock haloes. Based on this 
analysis we chose values of Rfriend{z = 0.5) — 0.87 Mpc 

and Vfriend{z = 0.5) = 900 kms^^, which correspond 
to a catalogue that is 98% complete and 52% pure with the 
largest number of groups and clusters possible. Running the 
DFoF code with these values, we produced a catalogue of 313 
groups and clusters containing 1152 member galaxies. The 
galaxy groups and clusters have an average velocity disper- 
sion of CT„ = 467.97 kms~^ and an average size of Rdt ~ 0.78 
Mpc h-^. 

We tested the validity of our catalogue by obtaining 
SDSS galaxies in a Ideg^ region around each cluster centre. 
We then subtracted the background signal from all the SDSS 
galaxies within 1 Mpc of the cluster centre and stacked 
the results in bins according to cluster redshift. Each of these 
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Figure 13. SDSS g, r and i-bands colour images of field around CL_008 (top left panel), CL_204 (top riglit panel), CL_122 (middle 
left panel), CL_024 (middle right panel) and CL_038 (bottom panel). Yellow circles indicate the location of the cluster member galaxies, 
which are labeled with their individual redshifts. 




Figure 14. Left panel: Two-point correlation function for the groups and clusters found in the 2SLAQ catalogue using the DFoF code 
(blue squares with error bars) and best-fit power law slope (red solid line). Right panel: Two-point correlation function for the mock 
halo catalogue (black squares with error bars) and best-fit power law slope (red solid line), ^j^j was measured using the estimator of 
iLandv fc Szalavl diggaP . 
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bins was examined in SDSS photometric redshift space and 
colour-magnitude space. This analysis shows that our groups 
and clusters are reliable out to « ~ 0.6. The discrepancies 
beyond this range may be owing to the small number of 
groups and clusters detected at higher redshifts. Also, there 
may be some contamination effects in the histograms from 
the photometric redshift errors in the SDSS data. 

We produce mass estimates for our catalogue using the 
cluster velocity dispersions according to equation 1191 We 
tested the reliability of this approach by comparing mass es- 
timates for clusters found in the 2SLAQ mock with the true 
masses of the mock haloes to which they are matched. We 
find that the mass estimates of our mock catalogue groups 
and clusters are for the most part a good fit to the true halo 
masses, however with large error bars. The deviations seen 
are the result of contaminating galaxies in the richness esti- 
mates and the small number of high mass clusters detected. 
In general we see that the range of cluster masses is a good 
match to known masses of massive clusters. 

We analysed optical SDSS g, r and i-band colour images 
of a selection of the our clusters, which span different red- 
shifts. We observe that the galaxy members are distributed 
in a small region of space, on the sky and in redshift. We also 
see an overdensity of red galaxies around the cluster centres. 
This is strong evidence that the clusters are genuine. The 
distribution of LRGs in the clusters varies from spherical 
to elongated filamentary structures. This highlights an ad- 
vantage of the percolation method in that it makes no prior 
assumptions about the cluster shapes, which allows us to 
detect some structures that other methods may not. 

We test different clipping procedures on the 2SLAQ 
mock group and cluster sizes. This analysis indicated that 
the majority of the contamination in the cluster catalogue 
was attributed to groups with 3 or 4 members. We found 
that by clipping out clusters with Kdt > 0.11 Mpc 
for N 

mem — 3 and R^;* > 0.49 Mpc for N 

mem — ^ 

improves the purity from 52% to 88%, while reducing the 
completeness by only 4%, in the 2SLAQ mock catalogue. 
By applying this procedure to the real 2SLAQ group and 
cluster catalogue, we separated the clusters into 'gold' and 
'silver' samples. Where the gold samples consists of clus- 
ters that passed the clipping procedure and are therefore 
the most reliable and the silver sample consists of clusters 
that failed the clipping procedure and may still be genuine, 
however less reliable. Out of the 313 total 2SLAQ groups 
and clusters, 70 are gold and the remaining 243 are silver. 

Finally, we test the two-point correlation function of our 
cluster catalogue. We find a best-fitting power law model , 
^(r) — {r/roY' , with parameters ro = 24 ± 4 Mpc and 
7 = —2.1 ± 0.2. The value of the reduced chi-squared is 
Xred = 0.94. These values are consistent with those of the 
mock halo catalogue and are in good agreement with those 
in literature (.Nichol 200ll). 

Future surveys such as the Dark Energy Survey (DES), 
Euclid and Planck will images millions of galaxies across 
the whole sky. An abundance of photometric data will be 
obtained for each of these objects, however it will not be 
possible to obtain spectroscopic data for all of them. There- 
fore, it is important to develop reliable cluster finding tech- 
niques that utilise the photometric data that will be avail- 
able. Photometric redshifts, for example, provide a useful 
way to probe the properties of galaxies along the line of 



sight when spectroscopic data is not present. All of the re- 
sults presented in the paper will be compared to those found 
in Farrens et al. (in prep), which will examine galaxy clus- 
ters in the 2SLAQ catalogue using photometric redshifts. 
This comparison will test the reliability of the DFoF code 
to detect structures using photometric data. 
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APPENDIX A: CLUSTER CATALOGUE 

The catalogue contains 313 clusters with the following prop- 
erties: identifier, number of galaxy members (or richness), 
right ascension, declination, redshift, velocity dispersion (in 
kms~^), radial size (in Mpc h~^), estimated mass (in logio 
M0) and sample to which it belongs (G: gold ; S: silver). A 
sample of the first 10 clusters is shown in table- IXT] 
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Table Al. Cluster Catalogue Sample 



No 


ID 




RA 


Dec 


z 


cTj, (kms 


Km (Mpc h-^) 


logio Mest (Mq) 


Sample 


1 


CL.0008 


Oil 


152.85693 


-0.00323 


0.51165 


0562.51 


02.08 


14.3051 


G 


2 


CL.0031 


010 


210.59379 


-0.55691 


0.44007 


1178.60 


02.00 


15.2916 


G 


3 


CL.0060 


009 


035.42705 


-0.22677 


0.59094 


0784.58 


01.74 


14.7136 


G 


4 


CL_0140 


008 


342.39005 


-0.44521 


0.49785 


0846.21 


01.27 


14.8415 


G 


5 


CL_0204 


007 


158.86053 


-0.02893 


0.48510 


1108.13 


00.65 


15.1969 


G 


6 


CL.0038 


007 


191.56150 


-0.51722 


0.48679 


0511.13 


01.66 


14.1882 


G 


7 


CL.0051 


007 


190.76950 


-1-0.03930 


0.46779 


0786.90 


01.26 


14.7564 


G 


8 


CL_0122 


007 


161.24470 


-0.30396 


0.62371 


0851.02 


01.85 


14.8094 


G 


9 


CL.0064 


006 


202.19576 


-0.35692 


0.52837 


0381.83 


00.73 


13.7950 


G 


10 


CL_0024 


006 


343.37305 


-0.49057 


0.56982 


0591.89 


01.52 


14.3531 


G 



